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Abstract. We use the grid consisting of bits of 3" to motivate the defini- 
tion of 2-adic numbers. Specifically, we exhibit diagonal stripes in the bits of 
3^ , which turn out to be the first in an infinite sequence of such structures. 
Our observations are explained by a 2-adic power series, providing some reg- 
ularity among the disorder in the bits of powers of 3. Generally, the base-p 
representation of fc^ has these features. 



1. Several mysteries 

The binary representation of a number m can be thought of as encoding the 
unique set of distinct powers of 2 that sum to m. For example, 

81 = IOIOOOI2 = 1 • 2^ + • 2''^ + 1 • 2"^ + • 2^ + • 2^ + ■ 2^ + 1 ■ 2° = 2^ + 2"^ + 2". 

We will display binary representations graphically by rendering and 1 respectively 
as □ and H. For reasons that will become clear, the convention in this paper when 
displaying binary representations graphically is to reverse the order of the digits 
relative to the standard ordering, so that higher indices are to the right. For 
example, we write 81 = ■ I I ■ ■ = 2" + 2"* + 2'', where the dot identifies the 2° 
position (somewhat like a decimal point). 

The binary representations of the first several powers of 3 grow steadily in length: 

3° = i 

32 = 

33 = 



3^^ = ■"TT 



Figure [T] displays a grid in which the nth row contains the binary digits of 3". 
Pictures like this were considered by Stephen Wolfram [U page 119]. Small triangles 
and other local features can be seen, but overall we get an impression of uniform 
disorder. There is no global structure evident, aside from the right boundary of the 
pattern, which has slope log2 3. 

There are some global regularities, however. In particular, every column is even- 
tually periodic. This is because there are only finitely many (in fact 2") states 
that can be assumed by the first a columns taken together, so eventually the first 
a columns return to a state that they have reached before, at which point they 
become periodic. 
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Figure 1. Powers of 3 in base 2. The nth row consists of the 
binary digits of 3", in order of increasing exponents. 



In fact, each column is not just eventually periodic but periodic from the start. 
This is because each row has a unique predecessor, namely the integer obtained by 
dividing by 3. Put algebraically, 3 is invertible modulo 2" for every a > 1, so from 
a given row we may compute the previous row to as many bits as we want. 

What if we try to compute a bits of "row —1" — the predecessor to the initial 
condition? Certainly we can do this, and the result simply maintains the periodicity 
of the columns. We can iteratively compute predecessors and thereby uniquely 
continue the picture up the page. Figure [5] shows the end of the unique infinite 
"history" leading up to the initial condition. 

For those readers familiar with cellular automata, we mention that the ability 
to evolve the system backward in time is analogous to the same ability in a class of 
cellular automata whose local rules are bijective functions in the rightmost position. 
As with multiplying each row by 3 to form the next, information only propagates 
to the right and is not lost in these automata, and consequently they are reversible 
under the condition that the left half of each row is determined (say, all white) [3] . 
Indeed, several themes of the present paper can be carried over to such cellular 
automata. 
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Figure 2. Part of the history obtained by periodically continuing 
each column up the page. 



Regularity versus complexity in the binary representation of 3'^ 3 

But what do the bits in such rows mean? Rows n < in the history illustrated 
in Figure [2] do not represent integers, since they contain Is in positions arbitrarily 
far to the right: The sum used to compute the value of such a row diverges. For 
example, row —1 represents the "infinite integer" 

• • • ■ • ■ = 2" + 2^ + 2^ + 2^ + 2^ + 2^ + • • • 

OO 

= 1 + ^22^+1. 

1=0 

However, formally applying the geometric series formula to this divergent series 
produces 

1=0 

which is certainly a natural object for row rt = — 1 to correspond to. Similarly, row 
—2 represents 



. . NIMBI I I I I I ■ • ■ = 2° + ^ (2S*+3 + 2^'+^ + 2' 

i=Q 

2-^ 24 



6i+5\ 



1 



1-26 1 _ 26 1-26 



= 1 = 3- 
9 

This is our first mystery, and in fact for each n < there is a divergent series which 
produces 3" under invalid applications of the geometric series formula. 

Mystery 1. Each power 3" for n < is the "sum" of a divergent series. 

In order to resolve this mystery we must first encounter several additional mys- 
teries — all related to the first — regarding the binary representation of 3". 

Since there are only two cell values (□ and ■) in Figure [1] the period length of 
each column is a power of 2. (In fact for a > 3 the period length of 3" mod 2" 
is 2°~2.) Therefore, another consequence of the column periodicity is that row 
2" resembles the initial condition in several bits, since the periods of the first 
several columns will have just started over. Brenton Bostick brought this "local 
nestedness" to my attention at the Midwest NKS Conference in 2005. For example 
3^ = 81 = ■ZnHZBZH agrees with the initial condition 3^^ = M I I I I "H • • • to three 
places to the right of the 2° position. Other terms in the subsequence 3^ agree 
with the initial condition to more places: 

3^^ 



I I I I ■ ■ ■ I I I ■ I 



Figure [3] shows many more rows, each row truncated at 600 bits; the image can be 
efficiently produced with the following Mathematica code. 
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Figure 3. The subsequence 3^ . The nth row contains the first 
600 bits of 32". 



ArrayPlot [ 
Reverse /(§ 

IntegerDigits [PowerMod[3, 2"Range[0, 255], 2~600] , 2, 600] 

] 

The large triangular region of white cells indicates a sort of convergence to the 
initial condition: The farther down we go in this image, the more columns have 
stabilized to the first bit in their period — the bit in row 0. As Figure [3] shows, 
each column (except the leftmost column) eventually becomes white, because row 
is simply • • ■ I I I I M I I I I I ••• = 1 = 3". In other words, 3^ "converges" bitwise 
to 1 as n — > 00. 

This can be proven by Euler's theorem, which states that if k is coprime to b then 
/j'/'C') = I mod 6, where the Euler totient function (j}{b) is the number of integers 
I < X < b that are relatively prime to b. It is not difficult to convince oneself that 
if p is prime then (j){p"-~^^) = . = (p — In our case, letting fc = 3 and 

b — 2"+^ gives 3^" = 1 mod 2"+^. Letting n ^ 00 shows that every bit in 3^" 
eventually approaches the corresponding bit of 1. 

Perhaps we feel a little uneasy about giving much credence to this convergence, 
because certainly 3^ gets very large and far away from 1 as n gets large. Thus we 
record it as another mystery. 

Mystery 2. hm 3^" = 1. 

n— >oo 

In Figure [3] we see additional structure as well — surprising diagonal lines above 
the white triangular region. More diagonals are filled in as we go down the page, 
so there appears to be another bitwise-convergent sequence here. To change the 
diagonal lines into vertical lines, we make the first column white (for uniformity) 
and shear the image (shifting each row one position left relative to the row above 
it). The result is shown as Figure SI Indeed these (shifted) rows are converging 
bitwise to something — the row 



€! = ••• J ■ 



I I ■ ■ I ■ I ■ ■ I I ■ I I 



Regularity versus complexity in the binary representation of 3 



5 



j^i: r.-./;.; -.-'.jVi' .■>.?!: 



^^v;>.?r:^,V'-:.i-,y'.f;:. 



. 1. ;v:, . 



■■ "^^ ■: ■ ^ "^"^ .- "--v -^r"':.' ■ ^- ■ "-Jf- '.^^ .--/.•".I-** ■/■ ■ —'-'-1 ^" .1 

Ifr^i--- ' . ■'^ O- ■■.^>:;^.r:^'>,;^^■■.:■-.■:f^■>,■■;■.■.>■ f>:j,--'Vr-i '' ;.v^ 
■. ■■^■^:v^^l^vi>'^.l>■■.:;:■:*.:^^■^v^^^^^ vi-^^ 

Hill I 



Figure 4. Bits of 3^ , sheared so that the diagonal hnes are now vertical. 



The shearing operation can also be effected by shifting the nth row left by n bits; 
in other words, divide the nth row by 2". We may therefore write 



Cl 



lim ■ 

n^oo 



1 



2" 



In Figure |4] we also observe some secondary diagonal structures that were not 
easily visible before. They are not as demarcated as the first set and seem to be 
interacting with the complex background. In order to make the secondary diagonals 
vertical we would like to perform the same shearing operation. However, first we 
need to subtract the limiting pattern ci from each row. But subtract it how? The 
limit is a divergent "infinite integer" , but forming an integer from the first a bits 
of Cl and subtracting this integer from each row clears all the corresponding equal 
bits. Once we have subtracted the limit, we divide by 2" to remove the n bits of Os 
on row n. This produces Figure [SJ in which the secondary diagonals are no longer 
muddied by the background but produce a clear limiting pattern themselves. The 
new limit is 



3^ -1 



C2 



lim ■ 

n— ^oo 



Cl 







It is natural to let cq — lim„_j.oo 3^ = 1 be the first limit. If we continue to 
iterate this subtract-and-shear operation we continue to find convergent sequences 
of rows. This means that, despite the apparent complexity in bits of 3^ , every 
region can be decomposed into a sum of simple periodic regions. 

The next limit 



C3 = • • • J I I r 



satisfies 



lim ■ 



C2 



C3 =0. 



Let us take this expression and unravel it to see the structure better. We find 



3' - (C0 + Ci2"+C222" + C32-^") 
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Figure 5. Bits of ((S^" - l)/2" - a) /2", obtained from 3^" by 
twice subtracting the limit and shearing. 



as n — >■ oo. Replacing 2" with x reveals that this is a power series: 

3^ = Co + cix + C2X^ + c^x^ + 0(a;''). 
Of course, we know a power series for 3^, namely 

so we might conjecture that a = (log3)V*!- 

For i = we indeed have (log3)°/0! = 1 = cq. But for i = 1 the conjecture 
seems to fail, because ci is not a real number but an "infinite integer". (In any 
case, the bits of ci don't resemble the binary representation of the real number 
log3 = 1.00011001001111101010 • • -2.) 

Mystery 3. 'log 3" is not the real number log 3. 

A final observation we make is that the direction of convergence is opposite 
that of real numbers. All the sequences we have seen approach their limits by 
filling in bits from low indices to high indices, which is toward the right in our 
graphical convention of reversing the digits of integers. A convergent sequence of 
real numbers, on the other hand, fills in bits from high indices to low indices. Take 
the sequence (1 + 1/n)", for example. Some terms of this sequence (as real numbers) 
are shown in Figure [SI The convergence proceeds from left to right, which is the 
same graphical direction but opposite numerical direction as the convergence of the 
sequence 3^" in Figure |31 

In the setting of bits of 3^ , then, the low indices of a number are somehow 
stronger than the higher indices. Therefore we should really think of the "tail" of 
numbers as being backward from the normal sense: In this mode of convergence, 
two numbers are close to each other if their leftmost bits agree — if their difference 
is divisible by a large power of 2. This is why we have chosen the convention that 
higher indices are to the right. 

Mystery 4. Two numbers are close if their difference is highly divisible by 2. 
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Figure 6. Binary representations of (1 + 1/n)" as real numbers, 
with most significant bits on the left. The terms are slowly con- 
verging to e = 10.10110111111000010101 • • -2- 



2. 2-ADIC NUMBERS 

Our four mysteries suggest that there is a notion of number presenting itself 
through the binary representation of 3^ that is quite different from the real num- 
bers. From Mystery |4] we must conclude that in some sense 2* gets small as i gets 
large, and the other mysteries support this conclusion. Let us therefore make this 
a definition instead of a mystery and introduce a new notion of "size" to make this 
precise. 

Every rational number r ^ has a representation r = 2"^ for integers a, n, and 
d, where n and d are not divisible by 2. Moreover, a is unique. We want |r|2 to be 
large when a is small and small (but positive) when a is large. A natural choice 
is to let \r\2 = 2~"; this is called the 2-adic norm of r. For example, |64|2 = 1/64 
and I — 691/2730|2 = 2. Since is very highly divisible by 2, let us define |0|2 = 0. 

Since large powers 2* are small in the 2-adic norm, a rational number can be a 
sum of arbitrarily large powers of 2 when thought of 2-adically, just as it can be 
a sum of arbitrarily large powers of 1/2 when thought of as a real number. For 
example, 

oo 

3 ^ 

i=Q 

In fact, every rational number has the representation J^I^n '^i^' f*-"" some integer 
N and Ci € {0, 1}. For example, —1 is rendered 2-adically as 

^ oo 

-1 = = V 2* = 2" + 2^ + 2^ + 2^ + • ■ • ■ • • , 

1 - 2 ^ ■ 

1=0 

which illustrates that the 2-adic representation of a number is really a "limit" of 
its representations modulo 2°" as a ^ oo. For finite a, the representation modulo 
2° of course coincides with its two's complement representation. 

In general, a bits of the 2-adic representation of a rational number r = 2" can be 
found by computing the inverse d~^ mod 2°", which is an integer, and multiplying 
by 2"n. 
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One can check that the 2-adic norm induces a metric d{x,y) — \x ~ y\2 on 
the rational numbers, akin to the usual metric induced by the absolute value. In 
particular, it satisfies the triangle inequality \x — y\2 + \y — z\2 > |a; — z\2. 

There are some strange properties of this metric, however. Perhaps the most 
immediate is that every triangle is isosceles: If |a; — y\2 = \y — z\2, then the triangle 
is isosceles by definition. On the other hand, ii \x — y\2 =/= \y — z\2, then 

\x - z\2 = max(|x - y\2, \y - ^b)- 

For example, ii x — y = 20 and y — z ~ 6 then |x — y|2 = 1/4 7^ 1/2 = |y — z\2 and 
\x-z\2 = \26\2 = 1/2. 

Naturally, we write a;„ — a; if the sequence of 2-adic norms \x — Xn\2 approaches 

as n — i> 00. So indeed 3^" — > 1 in the 2-adic metric. 

Of course, when we take a limit of rational numbers we may not get another 
rational number. Traditionally, the real numbers can be constructed by taking 
limits of rationals with respect to the real metric; each real number has an expansion 
Si^iv Ci2~* for Ci e {0, 1}. Similarly, we can take limits of rationals with respect to 
the 2-adic metric and get a different completion of the rationals. This completion 
is called the set of 2-adic numbers, and each 2-adic number has a representation 
Si^jv Ci2*, where again a G {0, 1}. Like the real numbers, this set is complete — 
it contains all its limit points. 

It turns out that our power series 3"^ = X]i^o(^*^S ^)*-^V^' is correct, but it must 
be interpreted not as a real power series but as a 2-adic power series. This means 
that "log 3" is not the real number log 3 but the 2-adic number log 3. How do we 
compute it? The function log(l — x) has a 2-adic power series that coincides with 
its real power series: 

00 ^ 

iog(i-:.) = -ET- 

i=l 

Of course, in the real metric this power series diverges at x = —2, so it cannot be 
used to compute the real log 3. But 2-adically this series converges at x = —2 to 
the 2-adic ci = log 3. Similarly, C2 = (log 3)^/2!, C3 = (log3)'^/3!, and so on. 

To be precise, one must of course establish the standard objects of calculus over 
the 2-adic numbers — derivatives, power series, tests for convergence, etc. We do 
not undertake this task here but refer the reader to texts on the subject. The 
book of Gouvea [1] serves as a solid introduction, and Koblitz 2 provides a more 
advanced treatment. 

3. Generalizations 

The results of the previous section can be generalized in several directions, and 
we discover that the power series structure we have seen is quite common. 

Euler's theorem tells us that there is nothing particularly special about 3^ , and 
in fact 5^ and 7^ have exactly analogous structures in binary, as shown in the 
first row of Figure [T] In general, if k is odd then has a 2-adic power series 

1 + xlogk-\ . Gouvea fTI, Section 4.5] discusses the region of convergence of such 

power series. 

What about other bases h > 21 The second and third rows of Figure [7] show 
several examples. To address these cases we briefiy generalize the discussion to 
p-adic numbers for prime p. 
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Figure 7. Powers in base p. 

Of course we may define \x\b for general b (prime or composite) in the analogous 
way. For primes p, \x\p is a norm on the set of rational numbers. For composite b 
it is not since in general \x ■ y\b 7^ \x\i, ■ \y\i,; for example, |4|4 = 1/4 7^ 1 = |2|4 • |2|4. 
In fact, it is a theorem of Ostrowski that the p-adic norms and the real norm are 
(up to equivalence) the only nontrivial norms on the set of rational numbers. 

Evidently 4"^ ^ 1 in the 3-adic metric. However, the 3-adic limit of 2^ is not 
1 but 2 + 2-3^ + 2-3^ + -- - = —1. For general k relatively prime to p, Euler's 
theorem provides that k^P~'^'>p" = 1 mod p""*"^. In the limit, then, fc*" approaches 
a (p— l)th root of unity \^/^p^^) . This root of unity is congruent to k modulo p and 
is called the Teichmiiller representative of k. This accounts for the vertical stripes 
in the base-5 digits of 2^ and 3^ ; the 5-adic fourth roots of unity congruent to 2 
and 3 modulo 5 are irrational. Note also that 2^ + 3"'' — > 0. 

The p-adic power series of functions f{x) other than k^ are also evident in the 
base-p digits of /(p"). Let F„, C„, M„, and S„ be the sequences of Fibonacci, 
Catalan, Motzkin, and Bell numbers. The sequences C2" and have 2-adic 
limits. The sequences and i?2" do not have 2-adic limits, but i^22", ^22"+!, 
B22n , and B22n+i do, giving some indication of the ubiquity of p-adic convergence 
in combinatorial sequences. 

Finally, consider the factorial function xl. The terms of the sequence 2"!, of 
course, become highly divisible by 2, so 2"! — >^ in the 2-adic norm. However, a 
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theorem of Legendre implies that |2"!|2 = 1/2^" ^, and it turns out that 

2"! 

has a (nonzero) 2-adic hmit. 
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